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We present a new approach of topology biased random walks for undirected networks. We focus 
on a one parameter family of biases and by using a formal analogy with perturbation theory in 
quantum mechanics we investigate the features of biased random walks. This analogy is extended 
through the use of parametric equations of motion (PEM) to study the features of random walks vs. 
parameter values. Furthermore, we show an analysis of the spectral gap maximum associated to the 
value of the second eigenvalue of the transition matrix related to the relaxation rate to the stationary 
state. Applications of these studies allow ad hoc algorithms for the exploration of complex networks 
and their communities. 
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INTRODUCTION 



The study of complex networks has notably increased 
in the last years with applications to a variety of fields 
rangingfrom computer science [l| and biology to social 
science[3J3 and finance Q. A central problem in network 
science [g, [D] is the study of the Random Walks (RW) on 
a graph, and in particular of the relation between the 
topological properties of the network and the properties 
of diffusion on it. This subject is not only interesting 
from a purely theoretical perspective, but it has impor- 
tant implications to various scientific issues ranging from 
epidemics Q to the classification of web pages through 
PageRank algorithm @. Finally, RW theory is also used 
in algorithms for community detection 

In this paper we set up a new framework for the study 
of topologically biased random walks on graphs. This al- 
lows to address problems of community detection and 
synchronization |15| in the field of complex networks 
[l6L [T7I ] . In particular by using topological properties 
of the network to bias the RWs we explore the network 
structure more efficiently. A similar approach but with 
different focus can be found in [181 ] . In this research we 
are motivated by the idea that biased random walks can 
be efficiently used for community finding. To this aim 
we introduce a set of mathematical tools which allow us 
an efficient investigation of the "bias parameters" space. 
We apply this tools to uncover some details in the spectra 
of graph transition matrix, and use the relation between 
spectra and communities in order to introduce a novel 
methodology for an efficient community finding. The pa- 
per is organized as follows: in the first section we define 
the topologically biased random walks (TBRW) . We then 
develop the mathematical formalism used in this paper, 
specifically the perturbation methods and the parametric 
equations of motion, to track the behaviour of different 



biases. In the second section we focus on the behavior of 
spectral gap in biased random walks. We define the con- 
ditions for which such a spectral gap is maximal and we 
present numerical evidence that this maximum is global. 
In the third section we present an invariant quantity for 
the biased random walk; such constant quantity depends 
only upon topology for a broad class of biased random 
walks. Finally, in the fourth section we present a gen- 
eral methodology for the application of different TBRW 
in the community finding problems. We then conclude 
by providing a short discussion of the material presented 
and by providing an outlook on different possible appli- 
cations of TBRW. 



BIASED RANDOM WALKS 

RWs on graphs are a sub-class of Markov chains ■ 
The traditional approach deals with the connection of the 
unbiased RW properties to the spectral features of tran- 
sition operators associated to the network (2pj . A generic 
graph can be represented by means of the adjacency ma- 
trix A whose entries Aij are 1 if an edge connects vertices 
i and j and otherwise. Here we consider undirected 
graphs so that A is symmetric. The normal matrix T 
is related to A through T = Ak , where k is a diag- 
onal matrix with {k)n = ki, i.e. the degree, or number 
of edges, of vertex i. In the following we use uppercase 
letters for non-diagonal matrices and lowercase letters for 
the diagonal ones. Note that by definition kj = J^i^ij- 
Consequently Y] f Tjj — 1 with Ty 7^ iif = 1, i.e. if 
i and j are nearest neighbors vertices. The matrix {T^} 
defines the transition probabilities for an unbiased ran- 
dom walker to pass from j to i. In such a case Xy has the 
same positive value for any of the neighbors i of j and 
vanishes for all the other vertices [21]. In analogy to the 
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operator defining the single step transition probabilities 
in general Markov chains, T is also called the transition 
matrix of the unbiased RW. 

A biased RW on a graph can be defined by a more 
general transition matrix T where the element Tij gives 
again the probability that a walker on the vertex j of 
the graph will move to the vertex i in a single step, but 
depending on appropriate weights for each pair of vertex 
A genuine way to write these probabilities is to 
assign weights Wij which represent the rates of jumps 
from vertex j to vertex i and normalize them: 



T; 



Wi, 



(i) 



In this paper we consider biases which are self- 
consistently related to graph topological properties. For 
instance Wij can be a function of the vertex properties 
(the network degree, clustering, etc.) or some functions 
of the edge ones (multiplicity or shortest path between- 
ness) or any combination of the two. There are other 
choices of biases found in the literature such as for in- 



stance maximal entropy related biases [22J. Some of the 
results mentioned in this paper hold also for biases which 
are not connected to graph properties as will be men- 
tioned in any such case. Our focus on graph properties 
for biases is directly connected with application of biased 
random walks in examination of community structure in 
complex networks. 

Let us start by considering a vertex property Xi of the 
vertex i (it can be either local as for example the degree, 
or related to the first neighbors of i as the clustering co- 
efficient, or global as the vertex betweenness). We choose 
the following form for the weights: 



Wu = A,. 



(2) 



where the parameter f3 £ R tunes the strength of the bias. 
For j3 — the unbiased case is recovered. By varying 
(3 the probability of a walker to move from vertex j to 
vertex i will be enhanced or reduced with respect to the 
unbiased case according to the property Xi of the vertex 
i. For instance when Xi = ki, i.e. the degree of the vertex 
i, for positive values of the parameter /3 the walker will 
spend more time on vertices with high degree, i.e. it will 
be attracted by hubs. For /3 < it will instead try to 
"avoid" traffic congestion by spending its time on the 
vertices with small degree. The entries of the transition 
matrix can now be written as: 



(3) 



For this choice of bias we find the following results: (i) 
we have a unique representation of any given network 
via operator T(x,/3), i.e. knowing the operator, we can 
reconstruct the graph; (ii) for small we can use pertur- 
bation methods around the unbasied case; (iii) this choice 



of bias permits in general also to visit vertices with van- 
ishing feature x, which instead is forbidden for instance 
for a power law W ~ x a ; (iv) this choice of biases is 
very common in the studies of energy landscapes, when 
biases represent energies Xi = E% (see for example p3j 
and references therein). 

In a similar way one can consider a symmetric edge 
property yij (for instance edge multiplicity or shortest 
path betweenness) as bias. In this case we can write the 
transition probability as: 



(4) 



The general case of some complicated multiparameter 
bias strategy can be finally written as 



J2 lAlj e^^ )+ ^^f 



(5) 



While we mostly consider biased RW based on vertex 
properties, as shown below, most of the results can be 
extended to the other cases. The transition matrix in the 
former case can also be written as: T{x,j3) = wAz^ 1 , 
where the diagonal matrices w and z are such that Wu = 
e^ Xi and z# = 1/ J2i Aue^ xi . The Frobenius-Perron 
theorem implies that the largest eigenvalue of T(x,(3) 
is always h((3) = 1 Furthermore, the eigenvector 

V\ associated to Ai is strictly positive in a connected 
aperiodic graph. Its normalized version, denoted as p(/3), 
gives the asymptotic stationary distribution of the biased 
RW on the graph. Assuming for it the form Pi((3) — 
r!(/3)~ 1 5l (/3)z i (/3), where z t = £^e^ and Q(/3) > 
is a normalization constant, and plugging this in the 
equation p — Tp we get: 
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(6) 



Hence the equation holds iif gi = e l3xi . Therefore the 
stable asymptotic distribution of vertex centerd biased 
RWs is 



Piifi) = «(/3) 



ZiW) • 



(7) 



For /3 = we have the usual form of the stationary 
distribution in an unbiased RW where ^(0) = fcj and 
O(0) = J2i^i- For general (3 it can be easily demon- 
strated that the asymptotic solution of edge biased RW 



is pi = Q 



while for multiparamctric RW the solution 



is Pi = Q^e^^n Zi. 

Using Eqs. and ([3]) we can prove that the detailed 
balance condition TijPj = TjiPi holds. 
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At this point it is convenient to introduce a different 
approach to the problem [l(]| . We start by symmetrizing 
the matrix T(x,(3) in the following way: 



(8) 



where p(/3) is the diagonal matrix with the stationary 
distribution {pi((3)} on the diagonal. The entries of the 
symmetric matrix for vertex centered case are given by 



T?Ax,0)=T! i (x,P) = A i 



(9) 



The symmetric matrix T (x,j3) shares the same eigenval- 
ues with the matrix T(x,f3); anyhow the set of eigenvec- 
tors is different and forms a complete orthogonal basis, 
allowing to define a meaningful "distance" between ver- 
tices. Such distance can provide important additional 
information in the problem of community partition of 
complex networks. If v„ is the v th eigenvector of the 
asymmetric matrix T(x,f3) associated to the eigenvalue 
A„(/3) (therefore vi = p), the corresponding eigenvec- 
tor \v v ) of the symmetric matrix T (x,/3), can always be 
written as \v u )i — In particular for v = 1 we have 

\vi)i = \p)i = yfpi. The same transformation (j8]) can be 
applied to the most general multiparametric RW. In that 
case the symmetric operator is 



T°Jx,l3)=TUx,p)=A l 



(10) 

This form also enables usage of perturbation theory 
for Hermitian linear operators. For instance, knowing 
the eigenvalue X v (/3) associated to eigenvector \v v (/?)), 
we can write the following expansions at sufficiently small 
A/3: \ v 03 + A/3) = \ v (/3) + A/3A^ (/?) + ... and M/3 + 
A/3)) = \v ( °\p)) + A/3|^ 1} (^)) + . . .. It follows that for 
a vertex centered bias 



\W(!3) = {v v (p)\f s(1 \x,p)\v„(P)} 



(11) 



where, 



f s{1 \x,py. 



df (x,p) 
dp 



{Z,T }+ 



-{£(/3),T }_ 



(12) 

with {•,•}+ being the anticommutator operator. Opera- 
tor x and x(/3) are diagonal matrices with (x)a — Xi and 
Xu(f3) = Auxie^ Xl J z{i) which is the expected value 
of x that an random walker, will find moving from vertex 
i to its neighbors. In the case of edge bias the change 
of symmetric matrix with parameter /3 can be written as 

= F *T*(/3) - l/2{y,f S (J3)}+, and * represents 
the Schur-Hadamard product i.e. element wise multipli- 
cation of matrix elements. The eigenvector components 



in (3 + A/3 at the first order of expansion in the basis of 
the eigenvectors at /3 are given by (for p ^ v): 



^(/3)|T s(1) (/3)M/3)) 
A„G9)-A„G8) 



(13) 



For fi = v the product (v ^(fi^v^ ((3)} vanishes and 
eqs. (|12p and ([13")) hold only for non-degenerate cases. 
In general, usual quantum mechanical perturbation the- 
ory can be used to go to higher order perturbations or to 
take into account degeneracy of eigenvalues. 

We can also exploit further the formal analogy with 
quantum mechanics using Parametric Equations of Mo- 
tion (PEM) [Hi HI] to study the /3 dependence of the 

* s 

spectrum of T . If we know such spectrum for one value 
of /3, we can calculate it for any other value of /3 by 
solving a set of differential equations corresponding to 
PEM in quantum mechanics. They are nothing else the 
expressions of Eqs. (jTTJ) and ([T3"|) in an arbitrary com- 
plete orthonormal base {!</>;,)}. First the eigenvector is 
expanded in such a base: \v u {[3)) — \ v v(P)) = 

Y^t c u ^(f3)\4>^) . We can then write 



d/3 AP) 



df a " 



dp 



(14) 



where c v (cj) is a column (row) vector with entries 

c^(/3) and T ' is the matrix with entries T^(j3) = 

{cj) v \T S (f3)\(f>^). Let us now define the matrix N((3) whose 
rows are the copies of vector cj (/3) . The differential equa- 
tion for the eigenvectors in the basis {|0i/)} is then [25| 



9/3 



(f s ' (/3)-A,(/3)+iV(/3) 



' df s '\B) 



df s '\(S) 
dp 



(15) 



A practical way to integrate Eqs 
be found in 



25|. 



matrix ( f'^^P) 



([Til) and C3 can 

In order to calculate parameter de- 
pendence of eigenvectors and eigenvalues, the best way 
to proceed is to perform an LU decomposition of the 

Ai/(/3) + N(fi)j as the product of a 

lower triangular matrix L and an upper triangular ma- 
trix [/, and integrate differential equations of higher or- 
der which can be constructed in the same way as equa- 
tions (fT4j) and (|15|) [251 ]. A suitable choice for the basis 
is just the ordinary unit vectors spanned by vertices, i.e. 
{|</))} = {|e)}. We found that for practical purposes, de- 
pending on the studied network, it is appropriate to use 
PEM until the error increases to much and then diag- 
onalize matrix again to get better precision. PEM effi- 
ciently enables study of the large set of parameters for 
large networks due to its compatitive advantage over or- 
dinary diagonalization. 
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FIG. 1: (color on line) Plot of the spectral gap Ai — A2 
vs ft for networks of 10 communities with 10 vertices each 
(the probability for an edge to be in a community is pi — 
0.3 while outside of the community it is p — 0.05). Solid 
points represent the solutions computed via diagonalization, 
while lines report the value obtained through integration of 
PEM. Different bias choice have been tested. Circles (blue) 
are related to degree-based strategy, square (red) are related 
to clustering-based strategies, diamonds (green) multiplicity- 
based strategies. The physical quantities to get the variable 
x in Eq. @ in these strategies have been normalized with 
respect to their maximum value. 



SPECTRAL GAP 

A key variable in the spectral theory of graphs is the 
spectral gap [i = (Ai — A2), i.e. the difference between 
first unitary and the second eigenvalues. The spectral 
gap measures how fast the information on the RW ini- 
tial distribution is destroyed and the stationary distri- 
bution is approached. The characteristic time for that 
is t = -l/ln[(l - fj,)] ~ l/n [HI. We show in Fig. 1 
the dependence of spectral gap of simulated graphs with 
communities for different strategies (degree, clustering 
and multiplicity based) at a given value of parameter /3. 
In all investigated cases the spectral gap has its well de- 
fined maximum, i.e. the value of parameter (3 for which 
the random walker converges to stationary distribution 
with the largest rate. 

The condition of maximal spectral gap implies that it 
is a stationary point for the function A2(/3), i.e. that its 
first order perturbation coefficient vanishes at this point: 



= 



(v2((3 m )\ dT d ^ m) \v 2 (f3 m )) 

(v 2 Wm)\[x-x(P m )]\v 2 (P m )}, (16) 



where x and x arc defined above. The squares of entries 
c|j(/3) of the vector |i> 2 (/3)) in the chosen basis \<pi) = |e), 
define a particular measure on the graph. Equation (|16[) 
can be written as 



J2i c 2 i{flm){xi — Xi(fim)) = 0. Thus we conclude that 
the local spectral maximum is achieved if the average 
difference between property Xi and its expectation Xj, 
with respect to this measure, in the neighborhood of 
vertex i vanishes. We have studied behavior of spec- 
tral gap for different sets of real and simulated networks 
(Barabasi- Albert model with different range of param- 
eters, Erdos-Renyi model and random netwroks with 
given community structure) and three different strategies 
(degree-based, clustering-based and multiplicity-based). 
Although in general it is not clear that the local maxi- 
mum of spectral gap is unique, we have found only one 
maximum in all the studied networks. This observation 
is interesting because for all cases the shapes of spectral 
gap vs. (3 looks typically Gaussian-like. In both limits 
/3 — > ±00 the spectral gap of heterogeneous network is 
indeed typically zero, as the RW stays in the vicinity of 
the vertices with maximal or minimal value of studied 
property Xi. 



RANDOM WALK INVARIANT 

A fundamental question in the theory of complex net- 
works is how topology affects dynamics on networks. Our 
choice of /3-parametrized biases provides a useful tool to 
investigate this relationship. A central issue is, for in- 
stance, given by the search of properties of the transition 
matrix T which are independent of j3 and the chosen 
bias, but depend only on the topology of the network. 
An important example comes from the analysis the de- 
terminant of T as a function of the bias parameters: 



d/3 



N 



r)T s 



For vertex centered bias using eq. ffT2]) we have 



dp 



(17) 



N N 

U— 1 

d In Zi 



and using the diagonality of the x and xa{fi) = 
H A M (/3)z M Q8) A„(A))z M (A)) 

11 P /3x u 11 p p 0Xll ■ y w > 



In other words the quantity U^Li IS a topo- 

logical constant which does not depend on the choice of 
parameters. For j3 — we get FJ[u=i = cons t an d it 
follows that this quantity does not depend on the choice 
of vertex biases Xi either. It can be shown that such quan- 
tity coincides with the determinant of adjacency matrix 
which must be conserved for all processes. 
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FIG. 2: (color on line) The ratio between expected value of 
multiplicity for edges that are connecting vertices in different 
communities to the expected value of multiplicity for edges 
that are connecting vertices in the same community with re- 
spect to parameter fi. 




0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 
I 



FIG. 3: (color on line) Histogram of second, third, fourth 
and fifth eigenvalue of non-biased RW for 1000 GN networks 
with parameters N = 128, n — 32, pi n — 0.35, p ou t = 0.05. 
There is a clear gap between "community" band and the rest 
of the eigenvalues. 



COMMUNITY FINDING 

There are many competing; algorithms and methods for 
community detection [26| . Despite a significant scientific 
effort to find such reliable algorithms, there is not yet 
agreement on a single general solving algorithm for the 
various cases. In this section instead of adding another 
precise recipe, we want to suggest a general methodology 
based on TBRW which could be used for community de- 
tection algorithms. To add trouble, the very definition of 
communities is not a solid one. In most of the cases we 
define communities as connected subgraphs whose den- 
sity of edges is larger within the proposed community 
than outside it (a concept quantified by modularity [14| 
)■ 

Scientific community is therefore thriving to find a 
benchmark in order to assess the success of various meth- 
ods. One approach is to create synthetic graphs with 
assigned community structure (benchmark algorithms) 
and test through them the community detection recipes 
27 1 . The Girvan- Newman (G N) fl3 | and Lancichinetti- 
Fortunato-Radicchi (LFR) [28|, [29( are the most common 
benchmark algorithms. In both these models several 
topological properties (not only edge density) are un- 
evenly distributed within the same community and be- 
tween different ones. We use this property to propose 
a novel methodology creating suitable TBRW for com- 
munity detection. The difference between internal and 
external part of a community is related to the "physical" 
meaning of the graph. In many real processes the estab- 
lishment of a community is facilitated by the subgraph 
structure. For instance in social networks agents have a 
higher probability of communication when they share a 



lot of friends. We test our approach on GN benchmark 
since in this case we can easily compute the expected dif- 
ferences between the frequency of biased variables within 
and outside the community. 

In this section we will describe how to use TBRW for 
community detection. For j3 = our method is rather 
similar to the one introduced by Donetti and Munoz 
[3fjj ]. The most notable difference is that we consider 
the spectral properties of transition matrix instead of the 
Laplacian one. We decide if a vertex belongs to a com- 
munity according to the following ideas: (i) We expect 
that the vertices belonging to the same community to 
have similar values of eigenvectors components; (ii) we 
expect relevant eigenvectors to have the largest eigen- 
values. Indeed, spectral gap is associated with temporal 
convergence of random walker fluctuations to the ergodic 
stationary state. If the network has well defined com- 
munities, we expect the random walker to spend some 
time in the community rather than escaping immedi- 
ately out of it. Therefore the speed of convergence to 
the ergodic state should be related to the community 
structure. Therefore eigenvectors associated with largest 
eigenvalues (except for the maximal eigenvalue 1) should 
be correlated with community structure. Coming back to 
the above mentioned Donetti and Munoz approach here 
we use the fact that some vertex properties will be more 
common inside a community and less frequent between 
different communities. We then vary the bias parame- 
ters trying both to shrink the spectral gap in transition 
matrix and to maximize the separation between relevant 
eigenvalues and the rest of the spectra. 

For example in the case of GN benchmark the network 
consist of 4 communities each with n — 32 vertices i.e. 
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FIG. 4: (color on line) Contour plot of the difference between 
fourth and fifth eigenvalue A4 — A5 as a function of parame- 
ter Pk which biases RW according to degrees of the vertices 
and parameter /3m which bias RW according to multiplicities 
of the edges. Both degrees and multiplicity values are nor- 
malized with respect to the maximal degree and multiplicity 
(therefore the largest value is one). 



N = 128 vertices all together. The probability that the 
two vertices which belong to the same community are 
connected is p, n . The probability that the two vertices 
which belong to different communities are connected is 
Pout- The fundamental parameter 26[ which character- 
izes the difficulty of detecting the structure is 



/' 



fcout H - kin 



(20) 



where k ou t = p ut (N — n) is the mean degree related to 
inter-community connections and ki n = Pi n (n — 1) is the 
mean degree related to edges inside-community. As a 
rule of thumb we can expect to find well defined com- 
munities when /i < 1/2, and observe some signature of 
communities even when /1 < 3/4 [27]. The probabilities 
Pin and p ou t are related via the control parameter fi as 

(n— l)u 

P° ut ~ (N-n)(l~u)P in - 

We now examine the edge multiplicity. The latter 
is defined as the number of common neighbors shared 
by neighbouring vertices. The expected multiplicity of 
an edge connecting vertices inter-community and inside- 
communities are respectively 

E(M out ) = 2p mPo ut{n-l)+p 2 out (N -2n), 
E(M in ) = p 2 in (n-2)+p 2 out (N-n) (21) 

On Fig. [2] we plot the ratio of the quantitites above de- 
fined, E(M ou t)/E(M in ), vs. the parameter fi. 

We see that even for fj, > 0.5 the ratio remains smaller 
than 1 implying that the multiplicity is more common in 
the edges in the same community. Based on this analy- 
sis for this particular example we expect that if we want 



to find well-defined communities via TBRW we have to 
increase bias with respect to the multiplicity. Through 
numerical simulations we find that the number of com- 
munities is related to number of eigenvalues in the "com- 
munity band" . Namely one in general observes a gap be- 
tween eigenvalues A2, Xn/u-i an d the next eigenvalue 
evident in a network with a strong community structure 
(fi -C 1/2). The explanation that we give for that phe- 
nomenon can be expressed by considering a network of 
n separated graphs. For such a network there are n de- 
generate eignevalues Ai . . . A„ = 1. If we now start to 
connect these graphs with very few edges such a degen- 
eracy is broken with the largest eigenvalue remaining 1 
while the next (n— 1) eigenvalues staying close to it. The 
distance between any two of this set of (n — 1) eigenval- 
ues will be smaller than the gap between this community 
band and the rest of the eigenvalues in the spectrum. 
Therefore, the number of eigenvalues different of 1 which 
are forming this "community band" is always equal to the 
number of communities minus one, at least for different 
GN-type networks with different number of communities 
and different sizes, as long as /1 <C 1/2. For example in 
the case of 1000 GN networks described with parameters 
N = 128, n = 32, p in = 0.35, p out = 0.05, i.e. fi = 0.125, 
the histograms of eigenvalues are depicted on figure |3l 



For our purposes we used two parameters biased RW, 
in which topological properties are Xi = ki/max(k) i.e. 
the normalized degree (with respect to maximal degree in 
the network) and = My / 'max(Mij) i.e. the normal- 
ized multiplicity (with respect to maximal multiplicity in 
the network). We choosed GN network whose parame- 
ters are N — 128, n — 32, p in — 16/62 and p out = 1/12, 
for which fj, = 1/2. Being N/n = 4 the number of com- 
munities, as a criterion for good choice of parameters we 
decided to use the difference between A4 and A5, i.e., we 
decided to maximize the gap between "community band" 
and the rest of eigenvalues; checking at the same time 
that the spectral gap shrinks. In Fig. [4j we plot such a 
quantity with respect to different biases. 

It is important to mention that for every single network 
instance there are different optimal parameters. This can 
be seen on figure[51 where we show the difference between 
unbiased and biased eigenvalues for 1000 GN nets created 
with same parameters. As shown in the figure the differ- 
ence between fourth and fifth eigenvalue is now not nec- 
essarily the optimal for this choice of parameters. Every 
realization of the network should be independently ana- 
lyzed and its own parameters should be carefully chosen. 
In the Figs. [6] and [7] we present instead the difference be- 
tween unbiased and biased projection on three eigenvec- 
tors with largest nontrivial eigenvalues. Using 3D view 
it is easy to check that communities are better separated 
in the biased case then in the non-biased case. 
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FIG. 5: (color on line) Histograms of A2, A3, A4 and A5 
for 1000 GN networks described with parameters N = 128, 
n — 32, pin = 16/62 and p ut = 1/12. With black colour 
we indicate the eigenvalues of nonbiased RW, while with red 
we indicate the eigenvalues of RW biased with parameters 
f3k = —2.5 and /3m = 4.3. Note how this choice of parame- 
ters does not maximize "community gap" for all the different 
realizations of monitored GN network. 



FIG. 7: (color on line) Plot of the eigenvector components of 
the second, third and fourth eigenvector of biased RW with 
parameters /3fc = —2.5 and /3a/ = 4.3. Different markers rep- 
resent four different predefined communities. This is an ex- 
ample of the same GN graph realization with pi n = 16/62 and 
Pout ~ 1/12 as the one on the previous figure. For this choice 
of parameters /j, = 1/2. One can notice tetrahedral distribu- 
tion of vertices in which vertices from the same community 
belong to the same branch of tetrahedron. 
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FIG. 6: (color on line) Plot of the eigenvector components of 
the second, third and fourth eigenvector. Different markers 
represent four different predefined communities. This is an 
example of GN graph with pi n = 16/62 and p ou t = 1/12. 
For this choice of parameters /1 = 1/2. There is a strong 
dispersion between different vertices which belong to the same 
community. 

CONCLUSION 

In this paper we presented a detailed theoretical frame- 
work to analyze the evolution of TBRW on a graph. By 
using as bias some topological property of the graph itself 
allows to use the RW as a tool to explore the environ- 
ment. This method maps vertices of the graph to dif- 



ferent points in the iV-dimensional Euclidean space nat- 
urally associated with the given graph. In this way we 
can measure distances between vertices depending on the 
chosen bias strategy and bias parameters. In particular 
we developed a perturbative approach to the spectrum of 
eigenvalues and eigenvectors associated with the transi- 
tion matrix of the system. More generally we generalized 
the quantum PEM approach to the present case. This 
led naturally to study the behavior of the gap between 
the largest and the second eigenvalue of the spectrum 
characterizing the relaxation to the stationary Markov 
state. In numerical applications of such a theoretical 
framework we have observed a unimodal shape of the 
spectral gap vs. the bias parameter which is not an obvi- 
ous feature of the studied processes. We have finally out- 
lined a very promising application of topologically biased 
random walks to the fundamental problem of community 
finding. We described the basic ideas and proposed some 
criteria for the choice of parameters, by considering the 
particular case of GN graphs. We are working further 
in direction of this application, but the number of pos- 
sible strategies (different topological properties we can 
use for biasing) and types of networks is just too large to 
be presented in one paper. Furthermore, since in many 
dynamical systems as the WWW or biological networks, 
a feedback between function and form (topology) is ev- 
ident, our framework may be a useful way to describe 
mathematically such an observed mechanism. In the case 
of biology, for instance, the shape of the metabolic net- 
works can be triggered not only by the chemical proper- 
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ties of the compounds, but also by the possibility of the 
metabolites to interact. Biased RW can be therefore the 
mechanism through which a network attains a particular 
form for a given function. By introducing such approach 
we can now address the problem of community detec- 
tion in the graph. This the reason why here we have not 
introduced another precise method for community detec- 
tion, but rather a possible framework to create different 
community finding methods with different ad hoc strate- 
gies. Indeed in real situations we expect different types 
of network to be efficiently explored by use of different 
topological properties. This explains why we believe that 
TBRW could play a role in community detection prob- 
lems, and we hope to stimulate further developments, 
in the network scientific community, of this promising 
methodology. 
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